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Abstract 

We present preliminary results from our on-going study: 
Comparing and optimizing source detection procedures for 
XMM images. By constructing realistic spatial and spec- 
tral source distributions and "observing" these through 
the XMM Science Simulator we study how source charac- 
teristics and instrumental effects influence detection. We 
are currently undertaking a statistical analysis on the out- 
come of running source detection algorithms on the simu- 
lated EPIC pn and MOS images in various energy bands. 
Particular emphasis is on the efficiency and reliability for 
detecting the faint, extended emission from distant clus- 
ters of galaxies. Here we present simulated EPIC images 
applicable to the XMM Large-scale Structure Survey (10 
ksec pointings of "blank fields"). However, we emphasize 
that our procedure is flexible, including as many realistic 
source characteristics and instrumental effects as possible, 
and yet modest in computational demand. As such it can 
be used for simulating XMM data obtained from virtually 
any X-ray source. 

All simulation products are made available at our web- 



site (http : //www . astro . ku . dk/ xcosmos/) 
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1. Introduction 

One of the main goals of the XMM Large-Scale Struc- 
ture Survey (XMM LSS) ( [Pierre et al. 200 1| ) is to map 
the large-scale distribution of X-ray selected galaxy clus- 
ters out to a redshift of ~ 1 and to detect the richest 
clusters out to a redshift of ~ 2. Since the detection of 
faint, extended sources from X-ray data is a crucial part 
of the XMM LSS project, an efficient and reliable source 
detection method must be implemented for the survey. 



Previous investigations (Valtchanov et al. 2001; Re 



frcgier et al. 2001) have invoked simplified source field se- 
tups and left out some instrumental effects, when studying 
the efficiency of various detection algorithms on simulated 
XMM images and estimating, e.g., completeness and con- 
fusion limits. By using schematic as well as more realistic 
source fields and by a more complete treatment of instru- 
mental effects through the XMM Science Simulator soft- 



ware SciSim (which mimics the performance of XMM by 
ray-tracing incoming photons through the mirror and de- 
tector systems), we extend this earlier work and address 
additional questions, like 

— What is the optimum energy band for distant cluster 
detection in XMM LSS data? 

— How does cluster X-ray morphology and off-axis dis- 
tance influence detection and photometric reconstruc- 
tion of source properties? 

— How does one deal effectively with, e.g., blending of 
extended and pointlike sources and removal of bright 
point sources? 

Here we present a few simulated 10 ksec XMM EPIC 
images being used for comparing and developing source 
detection procedures for the XMM Large Scale Structure 
Survey. 

2. The Simulation Setup 

2.1. Source Fluxes 

Three types of objects are considered, AGN type 1 and 
2 and groups/clusters. Point source (i.e. AGN) fluxes and 
object types are ba sed on the X-ray B ackground (XRB) 
synthesis model of Gilli et al. (2001) , down to a limit- 
ing flux of 1.3 



x 10" 



erg cm" 2 s" 1 (0.5-2 keV). This 
XRB model is consistent with the projected cumulative 
number density N of sources as a function of X-ray flux 
S (the logN-logS re lation) as deri ved from deep X MM 
( Hasinger et al. 2001 ) and Chandra ( Tozzi et al. 2002 ) ob- 
servations. For clusters and groups, fluxes are provided 
by the assumed mass-temperature (Al ~ T; see below) 
and luminosity-temperature (Lx — T) relations down to 
5 x 10~ 15 erg cm" 2 s" 1 (0.5-2 keV). 

We have produced two sets of simulations, distinguished 
only by the assumed cluster properties. In the first set, 
clusters are drawn from standard Press-Schechtcr (P-S) 
realizations of halos in a flat ACDM cosmology with f) TO = 
0.3, a s = 0.94 and H = 65 km s" 1 Mpc^ 1 . They are ex- 
tracted from 100 Monte Carlo realizations of the redshift 
interval z = — 2 down to a mass floor of 10 13 Mm/i 



assuming the M — T relation of |Eke et al 
non-evolving (local) cluster Lx 



Q'noo' 



(1996)| and a 
T relation of L x oc T 3 



( Borgani et al. 1999| ) with no intrinsic dispersion. To in- 
corporate the observed Lx — T dispersion as well as the 
steepening of the Lx — T relation at low T (< 1 keV), 
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cluster fluxes are "corrected" for these effects by assuming 
L x oc T Q±CT , where a is the standard deviation of a Gaus- 
sian distribution centered at a. For T > 1 keV, a — 3 and 
a = 0.15 is assumed (consistent with results from ROSAT 
cluster surveys, cf. Borgani et ah 199Sj ) , while a = 5 and 
it = 1.0 is taken for T < 1 keV, based on galaxy group 
observations ( [Hclsdon fc Ponman 200C| ; pCue fc Wu 2000| ). 
T'Vi o pinof or flnv 1 1 1 1 1 1 i iii ~i , i ( i gpg cm ^ s ^ is imposed 



The cluster flux limit of 5 x 10" 
hereafter. 

The second set of simulations is based on a cosmo- 
logical N-body simulation, from which five fields cover- 
ing 2.9 x 2.9 deg 2 and the redshift interval z < 1.4 were 
extracted. Except for er 8 = 1.0, other parameters are as 
above. Clusters are identified among the ~ 1.3 x 10 6 dark 
matter particles in each field via a friends-of-friends al- 
gorithm (where gravitationally unbound particles are re- 
moved) and drawn from a random 30' x 30' sight line 
through one of the fields. The Lx — T relation, includ- 
ing scatter as above, is here assumed to evolve such as 
to mimic no evolution in an (Cl m = 1,^A = 0) universe, 
i.e. Lx oc T 3 (di/d^) 2 , where dh is the luminosity dis- 
tance in an (fl m = 0.3, = 0.7) universe and d* L is the 
corresponding value in an (fl m — 1, 51a = 0.0) cosmology. 

A low-redshift cut at z = 0.1 is imposed for compu- 
tational reasons, excluding ~ 2% and < 1% of the input 
clusters from the P-S and N-body calculations, respec- 
tively (i.e. in most LSS fields, such nearby groups/clusters 
will not appear). 

2.2. Spatial Flux Distributions 

AGN are assumed to be pointlike, whereas cluster surface 
brightness distributions follow a 2-D /3-profile of core ra- 
dius r c . Since there is to some extent a 'degeneracy' in the 
observational determination of /? and r c , we choose to fix 
r c and let only j3 vary. For each cluster, values of (3 and the 
ratio rj between minor and major isophote axis are drawn 



from observed distributions (Mohr et al. 1995), assuming 
these to be Gaussians of mean 0.65 and a — 0.16 (/?), and 
mean 0.80, a = 0.12 (rf). r c is fixed at 1/4 of the cluster 
radius, which is determined through the angular diame- 
ter distance relation assuming X-ray emission out to rsoo- 
The latter is directly provided by the simulations in the 
N-body case, while it is calculated for the P-S clusters in 
the point mass approximation. 

Groups and clusters are each represented by an en- 
semble of point sources spaced 2" apart, i.e. well below 
the on-axis PSF of FWHM ~ 6" (the in-flight measured 
mean value for the three X-ray telescopes at E = 1.5 keV; 



Jansen et al. 2001) 



2.3. Source and Background Spectra 

All sources are assigned a spectrum, with normalizations 
provided by the XRB model for point sources and by 
the P-S/N-body simulations for clusters. The latter are 



assigned an xSPEC-based mekal spectrum of metallicity 
Z = 0.25_?q, with temperature and redshift provided by 
the cosmological simulations. For the AGN, a power law 
spectrum is assumed, i.e. 1(E) oc E~ r . In consistency 
with observations (e.g., Page 199"§| ), we choose T — 1.7 for 
AGN1, while AGN2 are assigned a value T = 1.8 and an 
intrinsic H I column density of 10 22 cm" 2 . Given the spec- 
tral normalization in one band of the XRB model (e.g. 0.5- 
2 keV) this choice of spectra approximately reproduces the 
point source density predicted in the other model bands 
(2-10 and 5-10 keV). 

A uniform diffuse X-ray background is added, modeled 
as a sum of two power laws. From fits to XRB observa- 
tions we choose T = 1.4 and a normalization A at E = 1 
keV of 9.0 keV cm" 2 s" 1 sr" 1 keV" 1 for the power law 
dominating the XRB spectrum above E ~ 1 keV. This 
effectively acts in the simulations as the integrated con- 
tribution of sources fainter than our adopted flux limit. 
A second power law of Y = 3.0 and A = 1.0 is also in- 
cluded, to account for the observed excess of emission at 



low energies (e.g., Miyaji et al. 1998; Parmar et al. 1999) 
compared to that expected from the T = 1.4 power law. 
Adding this component also ensures an overall 1 keV nor- 



keV~ 



consistent 



malization of 10.0 keV cm s sr 
with most XRB measurements (e.g., |Chcn et al. 1997 ). 

Both individual sources and the X-ray background are 
subjected to a uniform absorbing column density of 3 x 
10 20 cm" 2 , which is the mean value expected for the XMM 
LSS 10 ksec pointings. On top of this the in-orbit mea- 
sured particle and internal background is added, using 
normalizations as given in the XMM-Newton Users' Hand- 
bookfj] and spectra roughly modeled as power laws includ- 
ing Gaussian representations of the four and two most 
prominent lines for the pn and MOS cameras, respec- 
tively (see also Katayama et al. 2002; a more detailed 



background model is soon to be implemented) . This back- 
ground component is not subject to mirror vignetting and 
hence not included in the input to SciSim. 



3. Data "Reduction" 

Simulated data are created for the EPIC MOS and pn 
CCD's (with thin filters) using an exposure time of 10 ks, 
with all CCD's in full frame mode. Spacecraft effects such 
as drift and jitter are neglected. The output of SciSim is, 
in accordance with real XMM data, in the form of Ob- 
servation Data Files (ODF's) which include photon event 
files for each detector and CCD chip. Images and expo- 
sure maps are created from the event files using the XMM 
Science Analysis System (xmmsas). 

For illustration purposes a source field image is pro- 
duced for each simulation. These are convolved with a 6" 
Gaussian and have image borders conforming to the field 
of view covered by the EPIC pn CCD's. 



1 |http: / /xmm. vilspa.esa.es/user/AQ2/uhb/xmm_uhb. htm 



Figure 1. Simulated 0.5-2 keV image (photon counts for 
pn+ 2M0S; logarithmic intensity scale). Spatial scale is 
28' x 28'. 



4. Simulation Results and Source Detection 

In Figures [l] and || we present examples of simulated EPIC 
images. These are "raw" 10 ksec photon count images, 
produced using the latest version (v3.0.0) of SciSim with 
the XMM Current Calibration Files as of Dec 11, 2001, 



and with the P-S prescription for extended sources (§2.1). 
For comparison, the corresponding input source field is 
shown in Figure [|. In this particular representation, most 
clusters are "group-like", with only two of the 12 input 
clusters having temperatures T > 3 keV. The brightest 
one, a T — 3.3 keV cluster at z = 0.35 with a 0.5-2 keV 
flux of 8 x 10~ 14 ergs cm" 2 s -1 , stands out in the lower 
left corner. 

Based on the simulated images, we have done prelimi- 
nary tests of the performance of the wavelet-based XMM- 
SAS source detection task ewavelet, using wavelet scales 
of 2, 4, 8, 16, and 32 pixels, and a detection significance 
threshold of 6a. This method was chosen as a starting 
point because it was demonstrated to work reasonably 



well for the simulated XMM images of Valtchanov ct al. 
(2001), and in this case, loosely judged, achieved opti- 
mum performance with the above parameters. A result 
of running this specific setup of ewavelet on the 0.5-2 
keV image of Fig. [I] is illustrated in Figure ||, showing 
the wavelet-reconstructed image. Inspection of this image 
readily shows that four extended sources have been "de- 
tected" . Comparing the result with the input source field 
in Fig. H reveals that one of these extended sources (the 
most central one) is actually two point sources blended 



Figure 2. As Fig. |J, but for the 2-10 keV band. 



together, whereas another results from the blending of 
two extended sources (this is not surprising, as one of 
these blended clusters is quite faint, being just above the 
adopted input flux limit of 5 x 10 -15 ergs cm" 2 s" 1 ). How- 
ever, the brightest cluster mentioned above is safely de- 
tected, and a relatively faint (7 x 10~ 15 ergs cm~ 2 s _1 ), 
T = 3 keV cluster at z = 1.01 is also found. The remain- 
ing extended sources are apparently either too faint, too 
far off-axis (clearly the case for at least five of them), or 
suffer too much blending with bright point sources to be 
detected in this case. We further note in passing that the 
number of point sources detected roughly corresponds to 
half the input number — and this will certainly improve 
when including the results based on the 2-10 keV image. 



5. Outlook 

The simulated 10 ksec EPIC images, examples of which 
have been shown in this paper, will provide a baseline 
for testing and optimizing cluster detection procedures 
for the XMM Large Scale Structure Survey. This will, 
among other things, allow us to firmly address the ques- 
tions posed in the introduction. 

The detection of the z ~ 1.0 cluster described above 
seems particularly promising with respect to achieving our 
goal of finding high-redshift galaxy clusters, as it sug- 
gests that at least certain, reasonably massive but distant 
(z > 1) clusters could be well within the reach of detec- 
tion in the XMM LSS Survey (and other blank field XMM 
exposures of similar depth) . It should also be emphasized 
in this context that work still remains in order to opti- 
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Figure 3. Input source field, corresponding to the simu- 
lated images shown in Figures ^| and [|. 



mize the method described here for the specific case of 
our simulated images. 

A comprehensive analysis is currently being carried out 
involving additional detection algorithms, and we are also 
testing how selected methods for image restoration (in- 
cluding the publicly available version of the pixon code; 

source detection 



http://www.pixon.con) prior to source detection may 
improve detection performance. Further plans for the near 
future include testing source detection on deeper simu- 
lated exposures and to study how combining partially 
overlapping exposures will enhance detection. Results of 
this work will be presented in a forthcoming paper. 
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Figure 4- Wavelet reconstruction of the image shown 
in Fig. illustrating the sources detected above a 6a- 
significance. 
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